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Abstract 

Objective probabilistic forecasts of future climate that include parameter uncertainty can be 
made by using the Bayesian prediction integral with the prior set to Jeffreys' Prior. The calculations 
^S) , involved in determining the prior can then be simplified by making parametric assumptions about 

the distribution of the output from the climate model. The most obvious assumption to make is 
that the climate model output is normally distributed, in which case evaluating the prior becomes 
a question of evaluating gradients in the parameters of the normal distribution. In previous work 
we have considered the special cases of diagonal (but not constant) covariance matrix, and constant 
O I (but not diagonal) covariance matrix. We now derive expressions for the general multivariate normal 

^ . distribution, with non-constant non-diagonal covariance matrix. The algebraic manipulation required 

C/3 ' is more complex than for the special cases, and involves some slightly esoteric matrix operations 

y , including taking the expectation of a vector quadratic form and differentiating the determinants, 

C/5 ■ traces and inverses of matrices. 

>^' 

Pi- 1 Introduction 

We are interested in using climate models to produce objective probabilistic forecasts of future climate 

^ ' that include parameter uncertainty. The word 'objective' is used here in a technical statistical sense 

t^^ , that means that the prior distribution for the parameters is determined by a rule, rat her than from 

^=^ ' intuition. In statistics the most widely discussed rule is the Jeffreys' rule ( JefFreva . ll946l ). Jeffreys' rule 






could, in principal, be applied to a climate model directly from the definition. This would translate 
into using numerical methods to differentiate the predicted probabilities from initial condition ensembles 
and to take expectations over all simulated climate states. That could, however, be computationally 



f— ^ ' demanding. As an alternative one could fit distributions to the output from the model, and differentiate 

the estimated parameters of the distributions instead, which is likely to be considerably easier. The 
most obvious distribution to fit is then the multiva riate normal distribution. In previous work we have 
considered two special cases. In I Jewson et al.l ( 20091) we considered the case where the predicted variables 



^\f ' are independent, but both the mean and the variance of initi al condition ensemb les from the model 

^ , are allowed to vary as a function of the model parameters. In I Jewson et al.l ( 20101 ) we considered the 



complementary case where the predicted variables can be correlated, but with a covariance matrix that 
is constant as a function of the model parameters. We now consider the general case, with correlated 
predicted variables and a covariance matrix that can vary as a function of the model parameters. This 
general case contains the two special cases. The algebra in this case is not quite as elementary as the 
two special cases, in that we have to take expectations of a vector quadratic form, and differentiate the 
determinants, traces and inverses of matrices. Pedagogically, therefore, we consider the derivations used 
in the two simpler cases as still being useful, especially as climate modelling practice is perhaps unlikely 
to reach the stage where it would be necessary to consider the covariance matrix varying as a function of 
parameters for quite some time (there are many other more important challenges t o be dealt with first) . 
Detailed explanations of Jeffreys' Prior, and the motivation for its use, are given in I Jewson et al.l ( 20091 ) 



and will not be repeated here. In section [2] below we give the expressions for Jeffreys' Prior and the mul- 
tivariate normal density. In sectionjSjwe then derive the expression for Jeffreys' Prior for the multivariate 
normal distribution where the climate model has just a single parameter. In section|4|we derive the same 
expression, but considering multiple parameters. In both se ction |5| and se c tion Hj we also consider four 



special cases: independence (ta king us back to the r esults in I Jewson et al.l (|2009l )). constant covariance 



(taking us back to the results in I Jewson et al.l (2010|)), constant correlation and constant variance 
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2 Jeffreys' Prior and the Multivariate Normal Density 

We start with definitions of tlie Jeffreys' Prior and tfie multivariate normal density. 

2.1 Jeffreys' Prior 

Jeffreys' Prior is given by: 



p{0) == Jdet 



^,'d'^\ogp{x\t 



dO^dOk 



(1) 



D etailed explanations of t his equation, which can at first be rather difficult to understan d, are give n 
in lJewson et alj (|2009l ) and lJewson et al.l (|2010l ). as well as many statistics textbooks such as lLed (|l997l ). 



2.2 The Multivariate Normal Density 

Probability densities from the multivariate normal distribution are given by: 
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where, in our application: 



• x is a vector of those variables predicted by the climate model that are to be compared with 
observations 

• 9 is a, vector for the underlying parameters in the climate model 

• fi—^{9) is a vector of the mean response of the model (in other words, the ensemble mean of x for 
an infinite-sized initial condition ensemble for fixed parameters 6) 

• S = S(0) is the covariance matrix of the response of the model (in other words, the ensemble 
covariance matrix of x for an infinite-sized initial condition ensemble for fixed parameters 9) 

• S — E^^ = S{9) is the inverse of the covariance matrix 

• D = det(S) ~ 0(9) is the determinant of the covariance matrix 
This gives: 

n, „ 1 , „ 1 



■yx - fi^ 
n, ^ 1 . „ 1 



lnp{x\9) = ln2Tr- -\nD - -{x- nY S{x- n) (3) 



- In 27r - - In Z) - -(x^ S-x - x^ S^I -- n^ Sx + fx^ S^) (4) 

Since S is symmetric we have x'^ S/.! = ^i^ Sx which means that the above expression for \np{x\9) simplifies 
a little to: 

\np{x\9) = -'^\n2TT-^\nD-hx^Sx-2x'^SiJi + ^FSii) (5) 

We now consider two cases: one parameter and multiple parameters. 

3 One parameter 

The first case we consider is where there is just a single parameter in the climate model. We mainly 
consider this case as a warm-up for the multiple parameter case, although it would also be relevant if one 
only wanted to model the uncertainty due to a single parameter, which might be a good approximation 
to the overall parameter uncertainty if that single parameter dominates the uncertainty. 
If we consider 9 to be this single (scalar) parameter, then differentiating equation [5] by 9 gives: 
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Expanding the third and fourth terms: 
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Since S* is symmetric ^^ S-p^ — -^Sji and so the 5th and 7th terms combine, giving 
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Differentiating again wrt 9 gives: 
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(where we have expanded the derivatives of products) 
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(where we have combined the 4th and 5th, and 8th, 10th and 12th terms) 
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Taking expectations: 
'92 lnp(x|i 



E 



89^ 



18^lnD 
2 89^ 



, 8^S 
89^ 



-E{x^)S 



^E ( x'^^x ) + i?(x^)^M + 2E{x'^)^^ 

9612' 



8 fi 8 fi 



2,,T 



89^ 89^ 



-5/i 



dfi^dS_ 



925 
9612' 

8fi'^ ^8^1 



-S- 



1 7,925 



(18) 
(19) 



But E{x) ~ /i, and, from appendix 2 
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and so: 
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(using a standard result for the derivative of a determinant known as Jacobi's 
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(using a standard result for the derivative of a trace) 
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(using the linearity of the trace operator) 
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Now note that 
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Returning to equation 1251 and substituting in the expression given in equation 1411 gives: 
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This gives the prior; 
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If there are n observations then: 

• /i is an n X 1 vector 

• -g^ is an n by 1 vector 

• /i"^ is a 1 X n vector 

• ^^^ is a 1 by n vector 

• S" is an n by n matrix 

• p,n J ~aa 1^ ^ scalar 

• ^ is an n by n matrix 

• ^ is an n by n matrix 

• 4^ ^ is an n by n matrix 



tr(^^) is a scalar 
We now consider vari ous s pecial cases of this general formula, starting with the two cases described 



w e now consiaer vari ous s pecial cases o i tnis 
in Ijewson et al.l (|2009l ) and Ijewson et alJ POIOI ) 



3.1 Independence 

If the observations are modelled as independent then: 
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(50) 
(51) 
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(53) 
(54) 
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and so the prior is 

p{9) 



which agrees with equation 19 in I Jewson et all (|2009l ). 
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3.2 Constant Covariance 

If the covariance E is constant then equation 25] reduces immediately to 



m ^ r4s% m 



which agrees with equation 19 in I Jewson et all ( 2010r ). 



3.3 Constant Correlation 

If the correlations are modelled as constant (but the variances are allowed to vary) then: 

E = VCV (60) 

where y = diag(cr?, a^, ..., cr^) (61) 

and C = the correlation matrix (62) 

dV 

, dV dai dc72 9cr„ 

where— = 2dmg(ai — , a, — , ...,a„ — ) (65) 

S = V-^C-^V-^ (66) 

where V^^^ = diag(c^^^ CTa^^ ..., ct^^) (67) 

and C^ = the inverse correlation matrix 
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^^ere^j^ = -2diag ( -^ — -, -^ ^j^, ..., — ^ ) (71) 



^^ = AV-^C-^'-^'-^CV (72) 
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and the prior is: 



3.4 Constant Variance 

If the variances are modelled as constant (but the correlations are allowed to vary) then: 

E = VCV (75) 

where y = diag(CT^ a|, ..., fx^) (76) 

and C — the correlation matrix (77) 



^ = V'-^V (78) 

s = y-^c-^y-^ (79) 

where y^^ = diag(crf ^ c^2"^ ..., cr,;^) (80) 

and C~^ = the inverse correlation matrix (81) 

^ = y-^^y- (82) 
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and so the prior is: 



^(^) ^ l¥4-^(-^^S-) 
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\l 80 80 2 \ 80 80 

4 Multiple Parameters 

We now consider the muhiparameter case. We start with two parameters and generahse to multiple 
parameters later. The derivations are only slightly more complex than those for the single parameter 
case. Starting from equation [9l which was: 

8\np{x\0) l8\nD 1 j.8S rr.8S ^8^1 8fi'^ ^ 1 ^88 , , 

we now take the derivative wrt a second parameter (p: 
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Taking expectations: 
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But E{x) — II, and, from appendix 2, 
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Giving: 
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Substituting expression 11201 into equation 11041 gives: 
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We now generalize from two to multiple parameters. We change the notation so that 9 is the vector of 
all parameters. The prior is then: 
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If there are n observations and m parameters then: 
• /i is an n X 1 vector 
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• /i"^ is a 1 X n vector 
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S is an n by n matrix 
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W e now again consider various special cases of this general formula, starting with the two cases described 
in IJewson et al.l (|2009t) and Jewson et al.i (,20101) . 



4.1 Independence 

If the observations are modelled as independent then: 
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and so 
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which agrees with equation 36 in Ijewson et al.l ( 20091 ) . 
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4.2 Constant Covariance 

If the covariance S is constant then equation 11251 reduces immediately to 
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which agrees with equation 28 in I Jewson et al.l (2010|). 
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4.3 Constant Correlation 

If the correlations are modelled as constant (but the variances are allowed to vary) then: 
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and the prior is: 
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4.4 Constant Variance 

If the variances are modelled as constant (but the correlations are allowed to vary) then: 

E = VCV 
where y = diag(cr^, cr^, ...,ct^) 
and C — the correlation matrix 

as 

Wk 

s = v-^c-W-^ 
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where V^ ~ diag(o-j^ ^, ctj ^, ..., cr„^) 
and C— 1 = the inverse correlation matrix 
dS 



89, 

ds as 
WiWk 



v-^'-^v-^ 



V 



_^dC 
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^Vy-I^y 
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and the prior is: 



*l^) -~ Hw'M 



1 / , ,ac-i 8c ^^ 

-tr T/-1 V 

• ' ae, 89k 



Me.,^5l^.i..-(v-.C-.fc-.f. 



a^j a^fe 



(163) 
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5 Summary 

Climate models are statistical models, in that they produce probabilistic predictions (when run as initial 
condition ensembles) and have certain parameters that can only be determined by comparison of model 
results with observations. As a result, the Bayesian framework, in which probabilistic predictions made 
from models with different parameter values are combined together to make a single best probabilistic 
prediction, can be applied. Within that framework one has to specify a prior, and one must choose 
between a prior based on intuition or a prior based on a rule. The former is known as subjective Bayesian 
statistics, and the latter, objective Bayesian statistics. The authors are pursuing a research programme 
that is exploring methods by which objective Bayesian statistics can be applied in climate modelling. In 
this article we have discussed the application of the most standard objective prior, known as Jeffreys' 
Prior. 



Climate models are complex and the relationship between the parameters and the predicted distributions 
is also complex. However, the form of the predicted distributions themselves can often be rather simple, 
and for many groups of variables a multivariate normal may be a good approximation. We have shown 
that, by making this approximation, the calculation of Jeffreys' Prior can be reduced to differentiating 
the parameters of the multivariate normal by the parameters of the underlying climate model. We derive 
expressions for Jeffreys' Prior in this situation. 

The results from our two previous articles on this topic (jjewson et al.l ()2009[ ) and I Jewson et al.l (J2010I )) 
are special cases of the general results shown here. 

In all this work we have expressed Jeffreys' Prior in terms of the true, rather than the estimated, pa- 
rameters of the distributions from climate model predictions. In other words, we have assumed infinite 
rather than finite size initial condition ensembles. A further challenge is to rederive the expressions given 
above but incorporating estimation uncertainty. 

A Proof that if S is symmetric then a^Sb = b^Sa 

Since a^ Sb is a scalar it is equal to its tranpose, as so: 

a'^Sb = {a'^Sbf = b^S'^a (165) 



but if S is symmetric then 
Putting these two together gives: 



b'^S'^a = b^Sa (166) 

JSb = b^Sa (167) 



B Proof that E{x^Ax) = tr {T.A) + fF A^x 

E{x'^Ax) = ^^x,A,jXj (168) 

= e{y.x,x,A,A (169) 

= Y,E{x,x.j)A,j (170) 



But, by definition: 



So 



and so 



Ey = E{{x, - n,){x.j ~ n-j)) (171) 

= E{xiXj - Xifij - ^iiXj + ^li^iJ) (172) 

= E{xiXj) - E{xi)fij - ^iE{xj) + ^tfij) (173) 

= E{xiXj) - ^j^j - ^j^j + ^j^j) (174) 

= E{xiXj) - Htfij (175) 



E{xiXj) = T,ij + ^ifij (176) 

E{x^Ax) ^ '^{Y.^j + n^Hj)A,j (177) 

= ^^J^ijAij +^^HiHjaij (178) 

ij ij 

= tr (SA) + ^^A/i (179) 

= tr (AY.) + fi'^Afi (180) 
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